Pegasus, a workflow management system for science automation
نویسندگان
چکیده
Modern science often requires the execution of large-scale, multi-stage simulation and data analysis pipelines to enable the study of complex systems. The amount of computation and data involved in these pipelines requires scalable workflow management systems that are able to reliably and efficiently coordinate and automate data movement and task execution on distributed computational resources: campus clusters, national cyberinfrastructures, and commercial and academic clouds. This paper describes the design, development and evolution of the Pegasus Workflow Management System, which maps abstract workflow descriptions onto distributed computing infrastructures. Pegasus has been used for more than twelve years by scientists in a wide variety of domains, including astronomy, seismology, bioinformatics, physics and others. This paper provides an integrated view of the Pegasus system, showing its capabilities that have been developed over time in response to application needs and to the evolution of the scientific computing platforms. The paper describes how Pegasus achieves reliable, scalable workflow execution across a wide variety of computing infrastructures.
منابع مشابه
HUBzero and Pegasus: integrating scientific workflows into science gateways
In this paper, we described the benefits and the challenges of integrating existing scientific workflow technologies into science gateways. Scientific workflow managers are powerful tools for handling large computational tasks. Domain scientists find it difficult to create new workflows, so many tasks that could benefit from workflow automation are often avoided or performed by hand. Two techno...
متن کاملBringing Scientific Workflow to the Masses via Pegasus and HUBzero
Scientific workflow managers are powerful tools for handling large computational tasks. Domain scientists find it difficult to create new workflows, so many tasks that could benefit from workflow automation are often avoided or done by hand. Two technologies have come together to bring the benefits of workflow to the masses. The Pegasus Workflow Management System can manage workflows comprised ...
متن کاملA Taxonomy on Tools for Scientific Workflow Management System
Scientific workflow management systems (SWFMSs) have been shown important to scientific computing and services computing [4][5][6][7] as they provide functionalities such as work flow determination, process coordination, job scheduling and execution, provenance discover and error resistance. Systems such as Pegasus [11], Taverna [8], Swift [12] ,Vistrails [10], Kepler [9] have seen wide accepta...
متن کاملWorkflow Management in Cloud Computing
Cloud computing is a paradigm that provides demand service resources like software, hardware, platform, and infrastructure. Under cloud environment, workflow is an emerging technique for future scalable applications. This paper discusses the various tools for generating workflow and these tools have been compared on the basis of operating system, databases, architecture and so on. The applicati...
متن کاملPegasus and DAGMan From Concept to Execution: Mapping Scientific Workflows onto Today's Cyberinfrastructure
In this chapter we describe an end-to-end workflow management system that enables scientists to describe their large-scale analysis in abstract terms, then maps and executes the workflows in an efficient and reliable manner on distributed resources. We describe Pegasus and DAGMan and various workflow restructuring and optimizations they perform and demonstrate the scalability and reliability of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Future Generation Comp. Syst.
دوره 46 شماره
صفحات -
تاریخ انتشار 2015